A mathematical model for networks with 
structures in the mesoscale 



Regino Criado * Julio Flores J Alejandro Garcia del Amo ^ 
Jesus Gomez-Gardenes § and Miguel Romance ^ 
Department of Applied Mathematics 
Rey Juan Carlos University 
28933 Mostoles (Madrid), Spain 



Abstract 

The new concept of multilevel network is introduced in order to em- 
body some topological properties of complex systems with structures in the 
mesoscale which are not completely captured by the classical models. This 
new model, which generalizes the hyper-network and hyper-structure models, 
fits perfectly with several real-life complex systems, including social and public 
transportation networks. We present an analysis of the structural properties 
of the multilevel network, including the clustering and the metric structures. 
Some analytical relationships amongst the efficiency and clustering coefficient 
of this new model and the corresponding parameters of the underlying net- 
work are obtained. Finally some random models for multilevel networks are 
given to illustrate how different multilevel structures can produce similar un- 
derlying networks and therefore that the mesoscale structure should be taken 
into account in many applications. 



1 Introduction 

During the last years the scientific community has shown that many relevant prop- 
erties of communication systems, social networks and other biological and techno- 
logical systems may be described in terms of complex network properties, including 
structural and dynamical properties and the interplay between them (see, for ex- 
ample, [1], [2], [5], [H] or [16] and the references therein). The emergence of these 
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appealing results has forged the Complex Networks Analysis as an attractive and 
multidisciplinary branch of research that includes topics of sociology (social net- 
works), biological sciences (including metabolic and protein networks, genetic regu- 
latory network and many others), neuro- sciences (neural interaction networks, cor- 
tical networks) engineering (phone call networks, computers in telecommunication 
networks, Internet, the World Wide Web), theoretical physics and applied mathe- 
matics. 

The working objects of the Complex Networks Analysis are graphs (which are 
called complex networks) which occasionally prove unable to capture the details 
present in some real-life problems. One example of this situation can be found in 
sociology. Social networks analysis is used in the social and behavioral sciences, as 
well as in economics, marketing, and industrial engineering ([I?]), but some questions 
related to the structure of social networks have been not understood properly. From 
the fact that a social network can be understood as a set of people or groups of people 
with some pattern of contacts or interactions between them ([I5],[l7]) - think of the 
Facebook network - it might seem that all the connections or social relationships 
between the members of that network take place at the same level. But the real 
situation is far from this. The real relationships amongst the members of a social 
network take place (mostly) inside of different groups (levels) and therefore it cannot 
be modeled properly if only the natural local-scale point of view used in classic 
complex network models is taken into account. This phenomenon is related to 
the fact that several real-life complex systems have a community structure [S] that 
takes place in a scale that lies between the local and the global one and that gives 
information about the structure-functionality relationship. The problem of finding 
and analyzing the community structure of a complex network is one of the hot spots 
of the complex network analysis, since these meso-scale structures reveal valuable 
information about roles of groups of nodes |12] . 

The main goal of this paper is to present the concept of multilevel network as a 
new model useful to analyze with detail complex systems that has some structures 
in the mesoscale level. As we pointed out before, this kind of networks are present 
in many real-life situations, not only in social networks, but also in communication 
networks and all the other examples that exhibit a community structure. The in- 
troduction of a sharp mathematical model that fits this new structures may prove 
crucial to properly analyze the dynamics that take place in these complex systems 
which are far from being completely understood. An example of these phenomena 
is the dissemination of culture in social networks by using the Axelrod Model that 
has been recently studied by using meso-scale networks [9]. Yet the study of other 
meso-scaled complex networks and their dynamics is one of the scientific challenges 
of the nowadays complex network analysis. There are other mathematical models 
that consider non-local structures such as the hypergraphs (or hypernetworks) [1] 
or the hyper-structures [6] but they are not able to combine the local scale with 
the global and the mesoscale structures of the system. For example, if we want to 
model how a rumor is spread within a social network, it is necessary to have in mind 
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not only that different groups are linked only through some of their members but 
also that two people who know the same person don't have necessarily to know each 
other, and, in fact, these two people may belong to different groups or levels. If we 
try to model this situation with hyper-networks, then we will only be taking into 
account the social groups and not the actual relationships between their members; 
in contrast if we use the hyper-structure model, then we won't possibly know to 
which social group each contact between two nodes belongs. 

After multilevel networks are defined, we present a structural analysis of these 
objects in terms of the clustering coefficient and the metric structure which must 
be properly defined in this setting. We will compare these parameters with the 
corresponding ones for the underlying projection networks and we will present the 
analytical relationships between them. Despite the fact that there are sharp connec- 
tions between the structural properties of the multilevel network and the properties 
of their projection and layer (or slice) networks, we will present some examples that 
exhibit the deep differences between the classic complex network approach and this 
new point of view. 

Finally, in section 3, we will give several growing models to produce multilevel 
random networks, inspired by the Barabasi- Albert preferential attachment model of 
complex networks [3]. These methods are inspired by the bipartite networks models 
introduced by Ramasco et al. [H] and by M. Peltomaki and M. Alava [13], but in 
this case we have to take into account the meso-scale nature of the multilevel struc- 
ture. The use of growing random models is natural since many of the examples in 
real life that can be modeled by multilevel networks, such as social or transporta- 
tion networks, are dynamic models whose structure grow in time by addition of new 
nodes, links and layers. By using these models we perform several random testings 
that show that we can get very different multilevel networks (i.e. with very different 
meso-scale structure) which still have similar local and global structures when con- 
sidered as classic complex network. For example, we exhibit a multilevel network 
that at the meso-scale is like an Erdos-Renyi random network, but it has a scale-free 
structure if we consider it as a complex network. These random testings reinforce 
the idea that in many cases we must take into account the meso-scale structure in 
order to get deep understanding of the structure and dynamics of many complex 
systems. 

2 Multilevel Networks: A Structural Analysis 

As we pointed out in the previous section, a sort of naive approach to complex 
systems with structures in the mesoscale could suggest that hyper-networks and 
hyper-structures fit perfectly to these real-life systems. The key-point that makes 
these mathematical model not to be the best solution for systems with structures in 
the mesoscale has to do with the fact that both hyper- networks and hyper-structures 
are node-based models, while many real systems combine a node-based point of 
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view with a link-based perspective. For example, if we have a look again at a 
social network with structures in the mesoscale (such as several social groups within 
its structure), when we consider a relationship between two members of one social 
group(or several), we have to take into account not only the social groups that hold 
the members, but also the social group that holds the relationship itself, i.e., if, for 
example, there is a relationship between two people that share the same working 
group and the same sport group, we have to highlight if the relationship is due to 
sharing the same group of work or it has sport nature. This fact is not particular 
of social networks with structures in the mesoscale and a similar situation occurs, 
for example, in public transportation systems, where a link between two stations 
belonging to several transport lines can occur as a part of different lines. 

In order to avoid this node-based nature of hyper-networks and hyper-structures, 
we propose to introduce the following concept that combines the node-based with 
the link-based perspective: 

Definition 2.1 Let G = {X,E) be a network. A multilevel network is a triple 
M = (X, E, S), where S = {Si, . . . , Sp} is a family of subgraphs Sg = {Xg, Eg) of G 
such that 

G=\jSg, 

<?=! 

i.e. X = Xi U ■ ■ • U Xp and E = Ei U ■ ■ ■ U Ep. The network G is the projection 
network of M and each subgraph Sj G 5 is called a slice of the multilevel network 
M. 

This mathematical model perfectly suits social systems (as well as other complex 
systems) with structures in the mesoscale, since each social group can be understood 
as a slice graph in a multilevel network; thus we are simultaneously taking into 
account the nature of the links (i.e. relationships) and the nodes involved. 

It is easy to check that this new mathematical object extends both the classic 
complex network model and also the hyper- network model [1]. Let us point it out 
very briefly. On the one hand, if G = (X, E) is a network, then it can be easily seen 
as a multilevel network by considering M = (X, E, S), where S = {G}. 

On the other hand ii H = (X, H) is an hyper-network (i.e. X is a non-empty 
set of nodes and H = {Hi, . . . , Hp} is a family of non-empty subsets of X, each of 
them called and hyper-link of "H), then it can be seen as the multilevel set M = 
{X,En,Sn), given by 

Sn = {KH„...,KH,}, 

where Kh is the complete network obtained by linking every pair of nodes of Hj 
and 

p 

En = [jKH^. 

r=l 

By using similar arguments we can show that every hyper-structure [6] can be un- 
derstood as a particular multilevel network, by considering one slice network for 
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each hyper-link in the hyper-structure and each shce graph being a set of isolated 
nodes. 

Once we have introduced this new and novel mathematical object, we have to give 
suitable structural parameters to analyze it. We can give natural extensions of many 
of the usual tools of the complex networks' analysis, such as the clustering coefficient, 
an adjacency matrix/tensor, a natural network representation as a tripartite network 
or a geodesic structure, among many others. 

2.1 Clustering of multilevel networks 

In this subsection we extend the graph clustering coefficient introduced by Watts and 
Strogatz in [IB] to multilevel networks and we establish some relationships between 
the clustering coefficient of a multilevel network, the clustering coefficient of its slices 
and the clustering coefficient of its projection network. Recall that given a network 
G = {X, E) the clustering coefficient of a given node i is defined as 

^ number of links between the neighbors of i 

largest possible number of links between the neighbors of i 

We take a/0 to be for every number a. Thus, if we think of three people i,j and k 
with mutual relations between i and j as well as between i and k, the clustering 
coefficient of i is supposed to represent the likeliness that j and k are also related. 
The clustering coefficient of G is usually defined as the average of the clustering 
coefficients of all nodes. 

In order to give a definition os the clustering coefficient of a node in a multilevel 
network M = {X,E,S), where S = {Si, . . . , Sp} and Sq = {Xq,Eq) for every 
1 < g < p, we need to introduce some notation. For every node i E X call A/'(z) the 
set of all neighbors of the node i in the projection graph G. For every q = 1, . . . ,p 
we will denote Afq{i) = Af{i) fl Xq and Gq{i) the subgraph of the slice Sq generated 
by ^^q{i), i.e. G~q{i) = {^^g{i) ,E'q{i)) , where 

Wqii) = {{k,j} e Eq\k,j e^^q{i)}. 

Similarly we will define G{i) as the subgraph of the projection network G generated 
by Af{i), i.e. G{i) = {Af{{},E{i)), where 

m = {{k,j}eE\k,jeAfii)}. 

In addition the complete graph generated by N'q{i) will be denoted as usual by 
K^fq{i)- Note that the largest possible number of links between the nodes of Nqii) is 
precisely the number of links in Kj^^{i), i.e. 

\EqV>')\ < ^ , 

where \Afq{i)\ stands as above for the cardinality of Mqii). With the previous nota- 
tion we give the following 
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Definition 2.2 Let M = {X,E,S) be a multilevel network. The clustering coefE- 
cient of a given node z in M is defined as 

2 E 1^(^)1 
ca/(«) = — • 

9=1 

The clustering coefRcient of M is the average of all cm(0 o^^r the set of nodes. 

Notice that the clustering coefficient might have been defined differently. For 
instance we might have considered the clustering coefficient of each Sg, say Cg{i), 
and then take the average over the set {Sq}g^i. However we have opted for a 
definition that embodies the following idea: it is possible for a given node i to have 
two neighbors k,j with {i,k} G Eg, {i,j} G Eg/ with q ^ q' and such that there 
is a link {j, k} G Eg// with q" ^ q, q' . This is a natural situation when we think of 
social networks; indeed, one person i knows j from the aerobic class, i also knows k 
from her reading club while j and k know each other from the supermarket. This 
sort of situation produces more links in the subgraphs Gg{i) defined above than 
those already present in the slice Sg. Taking the alternative definition based on the 
clustering coefficients of the slices would not help to discriminate such situations; 
in contrast the definition proposed does. The following example should clarify this 
situation and justifies the restrictions of theorem 12. 4[ 

Example 2.3 If we consider the multilevel network M = {X,E,S) where X = 
{1, 2, 3, 4}, S = {Si, 5*2, 5*3} and G is as in the next figure, it is easy to check that 
ccii) = 1 for alH G X but however cs^{i) = for alH G X and g = 1, 2, 3. 



Si So S3 





G 



Figure 1: An example of the extreme difference between clustering coefficients 
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As we have seen in the last example, the clustering coefficient of the projected 
graph G and the clustering coefficient of each slice of M may be very different. 
However we are interested in establishing relations between the clustering coeffi- 
cient of the multilevel network, cm(0, and the clustering coefficient in the projected 
network, Ccii)-, if possible. That is the target of the next theorem: 

Theorem 2.4 Let M = {X, E, S) be a multilevel network and i E X . If we consider 
A{i) = {q e {1, . . . ,p} \ \J^q{i)\ < 2} and we denote 9{i) = \A{i)\, then 

cg{^) < CM{^)<PCG{^) (l + (p - 1) (^+-^^]] ■ (2.1) 



p — 9{i) \ \ p — 9{i 

Furthermore, if M is homogeneous, then 

1 



p-e{i] 



ccii) <CM{i) <cg{^. (2.2) 



Proof: 

Let us start with the left-hand inequality in f l2.ip . Using that N'q{i) C N'{i) for 
every 1 < g < p and E{i) = U^^iEg{i), we get that 

2\m\ EU\^MiW)\-^) 2\E{t)\ 



mmimi - 1) mm\m\ - 1) eu \Km\m)\ - 1 
^\m\ ^, ..... 2 1^(^)1 



< (P - ^(0) ^.r -^u.r^-^^ .^ < (^^ " ^(0) 



EU K(oi(iA/;(oi - 1) - EU mmiK 



{p-9{t))cM{t) 



smce 



J2 m^)mM - 1) = E W)mM - 1) < iP-emiAfmmi)] - 1)) . 

q=l qtA(i) 

In order to prove the remaining inequality notice that the number of links in 
the complete graph built up on the set of nodes A/'(i) never exceeds the sum over 
g = 1, . . . ,p of the numbers of links of the complete graphs built up on the sets of 
nodes A/'g(i) plus all the links possibly obtained by joining nodes of Mk{i) to nodes 
of Afj{i) where k and j are chosen among all possible pairs {k,j) with 1 < k < j < p, 
i.e. 



I E \-^M{\^M - 1) + E (2-3) 



2 - 2 

9=1 k<j 



where the last sum has (2) terms. Notice also that we can always assume, by 
rearranging if necessary, that |A/'g(i)| < \J\fq+i{i)\ for 1 < g < p — 1. In addition to 
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this, observe finally that 

5^ |Ar,(z)| (|A4(z)| - 1) < E - 1)1^.(01(9 - 1 

<(p-i)E(iA/;(oi-i)iA/;(oi- 

9=1 

Thus, since Eq{i) C E{i) for every 1 < g < p and by using fl2.3p 



(2.4) 



e;=i ia/;(oi(ia/;(oi - 1) - K(oi(K(oi - 1) 



Notice that ii 1 < k < j < p, then 

immM = (iA4(oi -1 + 1) iA/;-(oi < - 1) ia/;-(oi + 

Now if we combine the last formula with fl2.5p . we get that 

2E,<j\W)\\m)\ 



CM{i) < pccii) 1 + 



< pcGii) 1 + 2(p - 1) + 2=jr 



1-^.(01 (K(OI-i) 

It is easy to check that if g G A{i), then \Nq{i)\ (|A/'g(i)| — 1) = 0, which makes that 

Y.WM {WM-^)= E 1-^.(01 (1-^.(01 -i)> E i-^'/(oi- 

q=i qiA{i) qtA{i) 



Hence 



E;=iIA/;(OI (|A4(0I-i) - E,^A»l-AA,W 

^(0 



< 1 + 



2(p-^(z))^ 
8 



since if g G A{i), then < 1, while if g ^ A{i), then \Mq{i)\ > 2. Finally, if we 

use the last upper bound, we obtain that 



CM{^<pcGii) l + 2(p-l)+2, 



<PCg{i) ( 1 + - 1) (4 + 



'<?= 




p-e{t] 

On the other hand, if M is homogeneous, then the same techniques as before 
shows that cg(z) < — 9{i))cM{i)- Since M is homogeneous, then Mq{i) = J^{i) 
and Eq{i) C E{i) for every 1 < q < p. Therefore we conclude that 



e:=i ia/;(oi(ia/;(oi-i) 



2p|i^(^)| ^ 2|E(2)| ^ 



□ 



Remark 2.5 The definition of 6'(z) shows that the estimation in the previous the- 
orem gets coarser as the number of slices for which i has strictly less than two 
neighbors (possibly zero) increases; in a "normal" scenario this situation will not 
occur and the estimation will involve a polynomial of degree two in p. Anyway, since 
< 0{i) < p and the function f{x) = x/{p — x) is increasing in [0,p), then the last 
result ensures that 

cm(0 < gip) cg(«), 

where g{p) is of order with a G [2,3], but still it is not obvious whether the 
exponent 2 can be lowered. 

As it was said above there are other possible definitions for the clustering of a 
node in a multilevel network, based on the clusterings of that node in the slices, 
and then it is natural to have then related by means of estimations. However a 
remark is in order. Notice that it is possible to construct a network whose slices 
have clustering coefficients zero for every node while the clustering coefficient of the 
projection network is 1 for every node. Indeed, if we consider the example 12.31 as our 
multilevel network M, then it is straightforward to check that for a given node i, 
its clustering coefficient in each of the slices, Cq{i), is zero, and hence every convex 
combination of them, while the clustering coefficient cjv/(z) is 1. This means that 
we cannot expect to estimate cm(0 from convex combinations of {cq{i)}g=i. Still 
it makes sense to give a "slicewise" definition for the clustering coefficient of the 
node 2 in a multilevel network M and find estimations by means of the clustering 
already defined. Again some notation is required. Call 

Nqii) = { j G X I j is a neighbor of i in Sq} 
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and Gg{i) = {J\f*{i), Eg{i)), where 

E,{^) = {{k,J}EE,\k,JE^^;i^)} 

Note that Gg{i) is the subgraph of the shce Sg generated by Mgii). Thus the 
following definition is proposed. 

Definition 2.6 Let M = {X,E,S) be a multilevel network and z be a given node 
of M. The slice clustering coefficient of i is defined as 



q=l 



Notice that Gg{i) is a subgraph of Gg{i) as was illustrated in the example above. 
Accordingly Mg{i) C N'q{i) and hence the largest possible number of links between 
neighbours of i in the g-slice, cannot exceed the corresponding largest possible num- 
ber of links between neighbours of i in Afq{i). As above we have the relation 



9=1 k<j 



where the last sum has (2) terms. Also we can assume, after rearrangement, that 
\-^q{'i)\ ^ \J^q+i{i)\ for all 1 < g < p. Thus by mimicking the proof of Theorem 12. 4[ 
we can get a relationship between the clustering coefficient in the projected network 
cg(z) and the slice clustering coefficient CQ{i). 

Theorem 2.7 Let M = {X,E,S) be a multilevel network and i be a given node of 
M. Call A{t) = {q e {l,...,p}\\Af;i{}\ <2} and e{i) = \A{t)\. Then 

si 



P 



Notice that there are other possible variations for the definition of the slice 
clustering coefficient of a node i. For example, the average over the clustering 



coefficients Cg{i) of the slides: 



i ^ 



1 ^ 



The relation between this definition and definition 12.61 

p mm — cAn) < cMi) < p max — cAn) 

is easily derived from the following lemma. 
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Lemma 2.8 // ai, a2, . . . , a„ G [0, oo) and bi,b2, . . . ,bn G (0, oo), then 

Er=i " ^"=1 " ^r=i tr ' 

It is straightforward since 



□ 

The following examples illustrate the behavior of the different concepts of clus- 
tering previously introduced. As it can be seen, if each one is separately evaluated 
one gets extremely different values. 

Example 2.9 Consider M = {X, E, S) with X = {1,2, . . . ,n} and the shces = 

({1, 2, 3}, {{1,2}, {1,3}, {2, 3}}) and ^2 = ({1,2,..., n}, {{1,2}, {1, 3},..., {l,n}, {2, 3}}). 

In this case 

^1/1 1 \ 1 

''^^ ~ 2 Vl ^ (n-l)(n-2)/2y' ^ 2 

while 

^ l + (n-l)(n-2)/2 

We see that the two local clustering coefficients dramatically differ for n ^ 1. 

On the other hand, consider M = {X,E,S) with X = {1, 2, . . . , n^}, S = 
{Si, S2,..., Sn}, S, = (X, E,), El = {{1,3} ■.l<i<3< n"}, E, = {{1, 2}, . . . {l,n}, {3,3 + 
1}} if 2 < J < n - 1, E„ = {{1, 2}, ... {1, n], {2, n]]. Then 



1 - l)(n2 -2)/2 



^^^^^ n I^(n2 - l)(n2 -2)/2 ^ (n- l)(n-2)/2 ^ ' - l)(n - 2)/2 

= O 



n 



while 



(n2-l)(n2-2)/2 + l + ... + l 



""^^^^ (n2 - l)(n2 - 2)/2 + (n - l)(n - 2)/2 + . . . + (n - l)(n - 2)/2 ^ ' 
Also in this case the two local clustering coefficients are very different for ri ^ 1. 

Example 2.10 Consider M = {X,E,S), where the shce Sj = {Xj,Ej) is defined 
by Xj = {1,3 + 1,3 + 2} and E, = {{l,j + l},{l,j + 2}, {3 + 1,3 +2}} for 
1 < J < n - 2, Xn-i = {1, 2, n}, E^-i = {{1, 2}, {1, n}, {2, n}}. Then cg(l) = 1 
while cg(1) = 0{l/n). 
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Example 2.11 Now consider M = {X,E,S) with Xj = X = {l,2,...,n} and 
Ej = {{1, k} : 2 < k < n} U {{j + 1, k} : 2 < k < n, k J + 1} for 1 < J < n - 1. 
Then cg(1) = 1 while cg(l) = 0(l/n). 

Example 2.12 Now consider M = {X,E,S) with Xj = X = {l,2,...,n} and 
E^ = {{_1, j + {1, j+2}, j+2}} for 1 < J < n-2, = {{1, 2}, {1, n}, {2, n}}. 
Then c^(l) = 1 while cm(1) = 0{l/n^). 

Example 2.13 Consider M = {X,E,S) with Xj = X = {1, 2, . . . , 2ra}, A = 
{{l,k} ■.2<k<n}U {{k,l} ■.n+l<k<l< 2n}, Ej = AU{{j + l,j + 2}} for 
1 < J 4^1-2, = AU{{n,2}}, En = {{l,k} : 2 < k < 2n}. Then cm(1) ~ 1/4 
while cg(l) = 0(1/722). 

Example 2.14 Consider M = {X, E, S) with Xjj = X = {1, 2, . . . , n} and = 
{{i, j}} for 1 < z < J < n. Then cg(l) = 0, cm(1) = 0(1/^2) and cg(1) = 1. 

Example 2.15 Consider M = {X,E,S) with X = {l,2,...,n}, X^ = {1,2,3}, 
with El = {{1,2}, {1,3}, {2, 3}} and Xj = {l,j + 2} with Ej = {{l,j + 2}} for 
j = 2,...,n-2. 
Then 

'°W- (n-l)(n-2) =Q'^/"^) 

while 

1 + + --- + ^ 

Ca/ 1 = = 1. 

^ 1 + + --- + 

We conclude this section with a final observation. All what has been said refers 
to local clustering coefficients. The aggregation of the local quantities into a global 
one, or clustering of M, can naturally be considered. We left the details to the 
reader. 

2.2 Metric Structure: Distances and Efficiency 

If we want to introduce metric tools in a multilevel network M = {X, E,S), we first 
have to give the notion of path and length. A path u in M = (X, E, S) is a set of 
the form u = {{ii, ■ ■ ■ ,iq), {Si, ■ ■ ■ , Sq)} such that 

(i) ■ ■ ■ , ig) is a path in {X, E), 

(a) (5*1, ■ ■ ■ , Sq) is a sequence of slice graphs Si, - ■ ■ , Sq E S, 

(in) For every 1 < j < g, we have that ij is an edge in the slice graph Sj. 

By using this concept we can introduce a metric structure in a multilevel graph 
M = (X, E, S) as follows. 
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Definition 2.16 Let M = {X,E,S) be a multilevel network, /3 > fixed and 
u = {{ii, ■ ■ ■ ,ig), {Si, ■ ■ ■ , Sg)} be a path in M. The length of u is the nonnegative 



value 



where 



j=2 



A(j) 



1 if Sj ^ Sj_i, 
otherwhise. 



The distance in M between two nodes i,j & X is the minimal length among all 
possible paths in M from i to j. 

If we take /3 = 0, the previous definition gives the natural metric in the projection 
graph, while if /3 > 0, we introduce new metrics that take into account, not only the 
global structure of the projection network, but also the interplay between the slice 
networks, that help to model the multi-scale nature of real-life social networks. 

One way to calculate the distance matrix, that is a matrix A = (Xij) that contains 
in each position Ajj the distance in M between vertices i and j in X, is to consider 
an auxiliary graph in the following way. Every vertex of the multilevel network M 
is represented by a vertex in the auxiliary graph and, if a vertex in M belongs to 
two or more slice graphs in M, then we duplicated it as many times as the number 
of slide graphs it belongs to. Every edge of E is an edge in the auxiliary graph and 
there is one more (weighted) edge for each vertex duplication between the duplicated 
vertex and the original one. The distance between a duplicated vertex j and another 
vertex k is djk = min{dj'k, dj"k}, where j' and j" are the duplications of vertex j in 
the auxiliary graph. 

Once we have defined a distance in a multilevel network M = {X,E,S), it is 
easy to generalize the concept of efficiency given in [6] to this structure: 

Definition 2.17 Let M = {X,E,S) a multilevel network. The efficiency of M is 

where n is the number of vertices in X and dMihj) is the distance in M between 
vertices i and j. 

It is quite natural to try to establish comparisons between the efficiency E{M) 
of a multilevel graph M = {X, E, S) as it was just defined before and the efficiency 
E{G) of the underlying graph as a classic complex network, as it was introduced in 
[TU] . The actual analytical result that connects these two parameters are given in 
the following theorem. 

Theorem 2.18 Let M = {X,E,S) a multilevel network, and G = {X,E). Then 

^ E{G) < E{M) < E{G), (2.6) 
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where E{G) is the efficiency of the projection network G and E{M) is the efficiency 
of the multilevel network M. 

Proof: 

Let i,j E X and u he a path in M from i to j. We denote by dM{i,j) the distance 
between i,j in the multilevel network M and by dG{i,j) the distance between i,j 
in the network G. In a similar way, we denote by ^m{^) its length in the multilevel 
network M and by ^g(i^) its length in the network G. 

It is clear that dM{i,j) > dciiyj)- On the other hand, since A(-) < 1 then 

i=2 

< £gM + /3(^gM - 1) = + 1) - /3 < ^g(c^)(/3 + 1), 

Hence, since it is clear that dcii,]) = YmYl{^G{,^)\^ is a path in M from i to j}, we 
get that 

-dM{i,j) < dciij) 



f3 + l 

and therefore, by combining the last expressions with the reverse bound obtained 
before we obtain that 

11 1 1 

< < 



l3 + ldG{iJ) dM{iJ) dciij) 
which gives the result. 

□ 

Remark 2.19 The last bounds are sharp simply by making /3 — y O"*". 

We are equally interested in obtaining estimations of the efficiency of a multilevel 
graph in terms of the efficiencies of its slices. Precisely we obtain the following result. 

Theorem 2.20 IfM = {X, E, S) is a multilevel network such that S = {Si, . . . , Sp} 
and for every 1 < q < p we denote Sq = {Xq, Eq), then 

^-^II^^EiSq) < EiM), (2.7) 

where \Xq\ is the number of nodes of Sq and \X\ is the number of nodes of M . 
Proof: 

If we take 1 < q < p, since Xg C X and Eq C E, then for every i,j G Xg and 
every path u in 5*^ from i to j in 5*^, we get that isq{^) = ^m{^) > dM{i,j). Then 
dsAhj) >dM{i,j)- Therefore 
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which makes that 



E(S, 



< 



\Xg\(\X, 



I y -1- 



_ 1^1(1^1 -1) 

j) |X,|(|X,|-1) 



E{M). 



□ 



Remark 2.21 There is a reverse inequahty that can be estabhshed when M = 
(X, E, S) is an homogeneous multilevel network (i.e., a multilevel network such that 
Xq = X for every layer = {Xg,Eg) G 5"). Indeed, in this case the following 
relation is easily obtained 

E{Sq) < E{M) < E{G) < i\X\ - l)EiSg). (2.8) 



3 Some models for multilevel random networks 

In this final section we present some models to produce multilevel random networks, 
inspired by the Barabasi-Albert preferential attachment model of complex networks 
[3] and several bipartite networks models such as the collaboration network model 
proposed by Ramasco et al. [H] or the sublineal preferential attachment bipartite 
model introduced by M. Peltomaki and M.Alava [13]. We propose growing ran- 
dom models since many of the real-life examples that can be modeled by multilevel 
networks, such as social or transportation networks, are dynamic models and their 
structure grow in time by addition of new nodes, links and layers. The presented 
randomized models are determined by the following rules: 

(i) Model parameters. Our models have three main parameters: iV, m and Pnew 
We set X G N as the number of nodes in the multilevel network while 2 < 
m < N account of the number of nodes in each layer {i.e. if we take m = 2, 
we recover the Barabasi-Albert model [3]). In our model m will be fixed, but 
it can also be replaced by any other non-negative integer random variable if 
we want to produce more general models. Finally, we set Pnew ^ (0, 1] as the 
probability of joining a new node to the multilevel network. 

(ii) Initial conditions. We start with a seed multilevel network made of one single 
layer 5*0 of m nodes that are linked all to all, {i.e. Sq = Km)- We can replace 
the all-to-all structure by any other structure (such as a scale free or a Erdos- 
Renyi network), but the results obtained are statistically equivalent. 

(Hi) Layer composition. At each time step t, a new layer St of m nodes is added 
to the multilevel network. In order to determine St we have to give its nodes 
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and links. We start by choosing randomly an existing node of the multilevel 
network proportionally to its degree [preferential election). Therefore if at 
step t — 1, the set of nodes of the multilevel network is {vi, . . . ,Vn}, and ki 
denotes the degree of node Vi at time t — 1 in the projection network, then we 
choose the node Vi randomly and independently with probability 

ki 

Pi 

The chosen node will be the first node of the new layer St and we call it 
coordinator of the layer. As an alternative, we could choose the coordinator 
proportionally to the number of layers that it belongs to but, in this case, the 
model will be similar to the collaboration network model proposed by Ramasco 
em et al. [H] . Each of the remaining m — 1 nodes of St will be a new node 
with probability p^ew and an existing node with probability (1 — Pnew)- If "we 
have to add an already existing node, we will uniformly and independently 
choose it at random. Note that we can replace the uniform random selection 
by other random procedure (such as preferential selection), but the random 
tests done suggest that the multilevel network obtained have statistically the 
same structural properties when is large enough (A^ > 10'^). Anyway, we 
should have chosen m nodes Wi, . . . ,■5^ ("^i is the coordinator node) that will 
belong to the new layer St. 

(iv) Layer inner-structure. Once we have fixed the nodes Wi, of the new layer 
St., we have to give its links. First, we link all the nodes to the coordinator node 
in order to warrant that the new layer is connected. In addition to this, we set 
new links between each pair of nodes, say Vi and Vj (with 1 < i 7^ j < m) by 
using a random linking probability pij that we will present later. At the end 
of this step we have defined completely the new layer St. 

(v) Finally, we repeat steps (Hi) and (iv) until the number of nodes of the multi- 
level network is at least A^. 

These rules define a family of growing models for multilevel random networks 
that cover a wide range of different networks with different meso-scale structure, 
simply by changing the linking strategy, Pij, at each step. The simplest choice for 
this strategy is pij = 1, i.e. an all-to all strategy, thus linking all the nodes in the 
new slice. In this way, we recover a model similar to the Ramasco et al. model [Hj, 
that can be used to produce hyper-networks with a meso-scale structure similar to 
that found in scientific collaboration systems. 

In this paper we will focus on the following three different linking strategies: 

1. Erdos-Renyi type strategy (Model I). We fix a value Pij = punk G [0, 1] so that 
we add each link {vi.Vj} randomly and independently with probability punk, 
for every 2 < i ^ j < m. Note that this strategy does not take into account 
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the meso-scale structure of the multilevel network, since the existence of the 
link {vi,Vj} is independent of the nodes Vi,'Vj in other slices of the multilevel 
network. Clearly, this is a toy model inspired by the classic Erdos-Renyi model. 

2. Assortative linking strategy (Model II). For every 2 < i ^ j < m, we add 
randomly the link {vi, vj} proportionally to the number of common slices that 
hold simultaneously Vi and Vj. Hence if we denote by Qij the number of slices 
that hold simultaneously Vi and Vj at time step t (including 5*^) and by 
the number of slices that hold Vi at time step t (also including 5*^), thus the 
probability of linking node Vi with node Vi is given by 

Pij 



Qi + Qj 



for every 2 < i ^ j < m. The heuristic behind this strategy comes from 
social networks, since in this type of networks the relationships in a new social 
group are correlated with the previous relationships between the actors in 
other social groups [T7]. Hence, on the one hand, if two actors that belong 
to the new social group coincide in many (previous) social groups, then the 
probability of linking in this new social group is big and ,on the other hand, 
when two new actors join their first social group, the probability of establishing 
a relationship between them it is also high. 

3. Disassortative linking strategy (Model III). If we take 2 < i ^ j < m, the 
probability of linking Vi with Vj in the slice St is inversely proportional to the 
number of slices that hold simultaneously Vi and Vj, i.e. if we denote by pij 
such probability, then 

2Qij 
Pij = 1 

qi + Qj 

by using the same notation as in the assortative strategy {Model 11). Now, the 
heuristic inspiring this strategy lies in some transportation networks, such as 
the airline networks, where the links in a new line try to connect nodes that are 
not connected in previous lines. By using the last expression, the newcomer 
nodes of the slice will be linked to old nodes that belong to many slices with 
high probability, while the newcomers will not be linked between them also 
with high probability. This strategy prevents the multiple reiteration of the 
same link in different slices and stresses the dissasortativity between newcomer 
nodes and nodes that belong to many slices. 

Once we have introduced the three linking strategies, we now present some numerical 
results to shows that, despite the fact that the above tree rules produce hyper-graphs 
with similar projection networks, the structures found in the mesoscale can be quite 
different. 

Let us start by showing in figure[2]the degree distribution, P{k), of the projection 
network. It is clear that the degree distribution of the projection network obtained 
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Figure 2: Random testing of the degree distribution of the projection graph in the 
randomized multilevel models. Model I on the left, the model II is centered and 
Model III is on the right, each of them for m = 5, 20, 50 nodes per slice. The 
distribution of each type of network and size of slice is computed for 100 random 
networks of 5000 nodes each one. 

in three models display heavy-tailed profiles, and this fact is more relevant if the 
number of nodes of each slice is low, since in this case the Big Number Law's make 
that the degree distribution behaves as a power-law, P{k) ~ k~'^ with 2 < 7 < 3. 
Note that a discretizacion effect appears in all the distribution presented in figure |2] 
that makes that the degree distribution are oscillatory. This is due to the fact that 
each slice has the same number of nodes, and all of them are linked to the coordinator 
node, which makes that the degree of the nodes are concentrated in some values 
modulus m. In order to avoid this effect, we could modify step (i) replacing m by 
a non-negative integer random variable that prevents that the degree of each node 
increases modulo m. This discretization effect is present in the three models and 
increases when the size of the slices is large, but it is more persistent in Model III, 
since in this case the dissasortative character of the model stresses this phenomenon. 

In addition to the degree distribution of the projection network, in figure [3] we 
show the distribution P{q) of the number of slices that hold each node (we call 
it hyper-degree distribution for consistency with the hyper-graph theory notation). 
Obviously, from this figure we observe that the hyper-degree distribution is only 
depending on the number of nodes m of each slide and it is not correlated with the 
linking model used to construct the network. On the other hand, the behavior of 
P{q) depends quite strongly on the particular value of m. In particular, while P{q) 
can be approximated by a truncated power law for m = 5 (figure [31 A), it presents 



18 



Figure 3: Random testing of the hyper-degree distribution of the multilevel models 
(in log scale) for different number of nodes m in each slice for 100 random networks 
of 5000 nodes each one. The case m = 5 is on the left, m = 20 is centered and 
m = 50 is on the right, combined with some numerical approximation in each case. 




Figure 4: Random testing of the average degree distribution of each slice for 100 
random networks of 5000 nodes each one in the randomized multilevel models I, II 
and III. The case m = 50 is in panel A (on the left), while the case m = 100 is in 
panel B (on the right), both for models I, II and III. 

an exponential decay in the cases m = 20 and m = 50 (figures OB and[3lC). 

Figures [2] and [3] has shown that either the structure of the projection network and 
hyper-degree distribution are quite similar for the three random models presented. 
However, they are very different if we analyze their mesoscale structure, as figure 
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H] shows. This figure shows, for m = 50 and m = 100, the probabihty of finding a 
node connected to hk nodes within a randomly chosen shce of the network. It is 
straightforward to check that since the Erdos-Renyi type strategy is used in Model I, 
then the resulting distribution P{hk) is a binomial distribution with parameters m 
and piink (asymptotically a Poisson distribution). However, note that in figure |2] 
we found that the degree distribution of the projection network is scale- free. Corre- 
spondingly, Models II and III display totally different mesoscale properties as shown 
by their corresponding distributions P{hk). On one hand in Model II we observe 
that P{hk) tend to accumulate around the maximum possible value of hk, m, due to 
the assortative character of the linking strategy used. Conversely, in Model III the 
association between the nodes belonging to the same slice is much lower due to the 
disassortativity of the model. Again, while both Models II and III gives two differ- 
ent bimodal averaged distributions P{hk), the corresponding distribution for their 
projection networks are also of scale-free type. Therefore, these results illustrate 
that many different multilevel networks (with different mesoscale structures) can 
produce similar projection networks and therefore that we should take into account 
the mesoscale structure in order to give sharper models of many real-life problems. 

Acknowledgements 

The authors of this work have been partially supported by the Spanish Government 
Project MTM2009-13848. 

References 

[1] R. Albert and A.L. Barabasi, Statistical mechanics of complex networks, 
Rev. Mod. Phys. 74 (2002), 47-97. 

[2] A.L. Barabasi, Linked: The new science of networks, Perseus Publishing, 
Cambridge, Massachusetts, 2002. 

[3] A.L. Barabasi and R.Albert, Emergence of scaling in random networks. 
Science 286, 509-512. 

[4] C. Berge, Hyper- graphs . Combinatorics of Finite Sets, North-Holland, 1989. 

[5] S. BoccALETTi, V. Latora, Y.Moreno, M.Chavez and D.U.Hwang, 

Complex Networks: Structure and dynamics. Physics Reports 424 (2006), 175- 
308. 

[6] R. Criado, M. Romance and M. Vela-Perez, Hyper structures, a new ap- 
proach to complex systems. Int. J. Bif. Chaos 20 (2010), 877-883. 



20 



[7] E. Estrada and J. A. Rodriguez- Velazquez, Subgraph centrality and clus- 
tering in complex hyper-networks, Physica A (2006). 

[8] M. GiRVAN AND M.E.J. Newman, Community structure in social and biolog- 
ical networks, Proc. Natl. Acad. Sci. USA 99 (2002), 7821. 

[9] B. GUERRA, J. PONCELA J. GOMEZ-GaRDENES, V. LATORA AND 

Y. Moreno, Dynamical organization towards consensus in Axelrod model on 
complex networks, Phys. Rev. E 81 (2010), 056105. 

[10] V. Latora, M.Marchiori, Efficient Behavior of Small-World Networks, 
Phys. Rev. Lett. 87 (2001), 198701. 

[11] M.E.J. Newman, The structure and function of complex networks, SIAM Re- 
view 45 (2003), 167-256. 

[12] M.E.J. Newman , Detecting community structure in networks, Eur. Phys. J. B 
38 (2004), 321-330. 

[13] M. Peltomaki and M. Alava Correlations in bipartite collaboration net- 
works, J. Stat.Mech. (2006), POIOIO. 

[14] J.J. Ramasco, S.N. Dorogovtzev and R. Pastor-Satorras, Self- 
organization of collaboration networks, Phys. Rev. E 70 (2004), 036106. 

[15] J. Scott, Social Networks Analysis: A Handbook, 2nd ed.. Sage, London, 2000. 

[16] S.H. Strogatz, Exploring complex networks. Nature 410 (2001), 268-276. 

[17] S. Wasserman and K. Faust, Social Networks Analysis, Cambridge Univer- 
sity Press, Cambridge, 1994. 

[18] Watts, D.J., Strogatz, S.H., "Collective dynamics of small- world net- 
works", Nature 393, 440-442 (1998). 



21 



